CONTENTS

1. Original Dataset Preview

2. Preview after Data Cleaing and Preprocessing

3. Introduction and Problem Statement

4. Section 1 - Demographic Analysis Visualizations

5. Section 2 - Economic AnalysisVisualization

6. Correlated Analysis

7. Conclusion

8. References

INTRODUCTION AND PROBLEM SATATEMENT

Suicide is a major public health issue that has affected communities all over the world. Despite technological and medical advances, the number of suicides has steadily increased, making it one of the leading causes of death worldwide. With over 800,000 suicides recorded each year, we must understand the underlying causes of this phenomenon and take action to reduce its occurrence.

Depression is frequently cited as one of the leading causes of suicide, but it is not the only one. Suicide rates are increasing across all age groups, genders, and generations, indicating a complicated and multifaceted problem. To gain a better understanding of suicide and its contributing factors, data from various countries must be analyzed and compared using various parameters such as GDP per year, GDP per capita, total population, gender, and more.

This dataset provides a comprehensive overview of suicide rates in various countries and a rich source of information for analyzing trends and patterns related to this issue. The dataset contains 12 distinct columns that contain data such as the country, year, gender, age, total number of suicides, total population, country-year, GDP per year, HDI for the year, GDP per capita, and generation. With such a large dataset, we can delve deeper into the problem and gain valuable insights into the factors that are causing suicides.

We can better understand the “Suicide Rates Overview 1985 to 2016” for different countries, age groups, and years. The study and report have been divided into three distinct sections that are highlighted. The opening part talks about “Demographics in terms of Suicide”. Additionally, section 2 discusses “Economic Analysis based on Suicide Cases”, and the final part highlights “Correlated Analysis”

This analysis will shed light on the patterns and trends in suicide rates across different countries and demographics, providing valuable insights into the complex issue of suicide.

Original Data Preview

##   country year    sex         age suicides_no population suicides.100k.pop
## 1 Albania 1987   male 15-24 years          21     312900              6.71
## 2 Albania 1987   male 35-54 years          16     308000              5.19
## 3 Albania 1987 female 15-24 years          14     289700              4.83
## 4 Albania 1987   male   75+ years           1      21800              4.59
## 5 Albania 1987   male 25-34 years           9     274300              3.28
## 6 Albania 1987 female   75+ years           1      35600              2.81
##   country.year HDI.for.year gdp_for_year.... gdp_per_capita....      generation
## 1  Albania1987           NA    2,156,624,900                796    Generation X
## 2  Albania1987           NA    2,156,624,900                796          Silent
## 3  Albania1987           NA    2,156,624,900                796    Generation X
## 4  Albania1987           NA    2,156,624,900                796 G.I. Generation
## 5  Albania1987           NA    2,156,624,900                796         Boomers
## 6  Albania1987           NA    2,156,624,900                796 G.I. Generation

Preview after Data Cleaning and Preprocessing

##   Country Year Gender         Age Number_of_Suicides Population Suicide_Rate
## 1 Albania 1987   male 15-24 years                 21     312900         6.71
## 2 Albania 1987   male 35-54 years                 16     308000         5.19
## 3 Albania 1987 female 15-24 years                 14     289700         4.83
## 4 Albania 1987   male   75+ years                  1      21800         4.59
## 5 Albania 1987   male 25-34 years                  9     274300         3.28
## 6 Albania 1987 female   75+ years                  1      35600         2.81
##   HDI_per_Year GDP_per_Year GDP_per_Capita      Generation
## 1           NA   2156624900            796    Generation X
## 2           NA   2156624900            796          Silent
## 3           NA   2156624900            796    Generation X
## 4           NA   2156624900            796 G.I. Generation
## 5           NA   2156624900            796         Boomers
## 6           NA   2156624900            796 G.I. Generation

Section 1: Demographic Analysis Visualizations

In this section, we will examine the suicide rate in relation to various demographic factors such as Age, Gender, Country, Year, Generation and Population. We can create charts or plots using advanvced visualization techniques to compare the suicide rate with each demographic, which will allow us to see any patterns or differences in the data. This will help us understand how these factors influence the suicide rate and identify any demographics that may require targeted interventions. We have divided this section to address multiple questions to draw conclusions based on each visualization.

1. Which year saw the highest number of suicides?

In 1999 and 2003, the highest number of suicides were recorded, with a total count of 256119 and 256079, respectively. The number of suicides increased every year from 1985 to 1996 and again from 1997, with a brief decrease in 1997. Further analysis is needed to understand the gender breakdown of these suicide cases.

2. How does age impact the number of suicides on a yearly basis?

The highest rate of suicides appears to occur in the age group of 35-54 years.

3. What is the breakdown of suicides and their rates by gender and age group?

Regardless of the year, the data shows that male suicides are approximately three times higher than female suicides. Furthermore, the suicide rate rises with age, indicating that age is a significant factor in determining suicide risk. This highlights the importance of targeted interventions to address the unique challenges that different demographic groups face, particularly older men who are at the highest risk of suicide. Understanding the relationship between gender, age, and suicide rate can aid in the development of effective prevention strategies and aid in the reduction of suicides.

4. How does age group and gender impact the number of suicides over time in a specific country ( United States )?

The trend in suicides in the United States reveals a persistent upward trajectory over the years and a correlation with age, with higher rates of suicide as age increases.

5. Do different generations exhibit variations in their suicide rates?

Stacked Bar Chart The generations play a significant role in the suicide cases, and each generation has contributed to the suicide cases over the years. The G. I Generation and Generation Z have been the prominent generations with a high number of suicides reported. The Boomers generation has made a significant count in the suicide cases, but a year of 2010 has been observed with no suicide cases reported for this generation.

Heatmap Suicide rates vary across different generations, as seen in the heatmap. G.I Generation has a higher intensity compared to other generations, but every generation has contributed to the overall suicide cases. Notably, G.I Generation and Generation Z have gaps in their suicide cases during certain periods.

6. How do countries differ in their suicide rates and demographics of those affected?

The graph presents the highest number of suicides by country. Japan, the Russian Federation, and the United States have the most reported cases.

7. What are the Top and Bottom 10 Countries with the highest and lowest Average Suicide Rate?

The top 10 countries with the highest average suicide rates were analyzed using a donut chart, with Lithuana having the highest rate at 12.8%. Sri Lanka followed closely at 11.2% and the Russian Federation at 11%. These results showcase the severity of the issue in these countries and the need for a closer examination of the underlying factors contributing to the high suicide rates.

On the other hand, the bottom 10 countries with the lowest average suicide rates were also analyzed using a donut chart. Interestingly, there were no recorded suicide rates in countries such as Dominica and Saint Kitts. Jamaica, on the other hand, had the lowest average suicide rate among these countries. These insights highlight the need for continued monitoring of suicide trends and the factors contributing to them, even in countries with low suicide rates, to ensure the well-being of the population.

8. Suicide rates for the top 10 countries

The box plot of suicide rates for the top 10 countries reveals that the distribution of suicide rates is quite varied. Republic of Korea has a high number of outliers, indicating that there is a significant amount of variability in the suicide rates for this country. On the other hand, the Russian Federation has the highest median suicide rate, with a median of 19.13. This highlights that the suicide rate in this country is significantly higher compared to other countries. On the other hand, the suicide rate is the lowest in Brazil, with a median of 2.80. This suggests that the suicide rate in Brazil is relatively low compared to other countries.

9. What is the pattern of suicide rates by age and gender in the top 6 countries with the highest suicide rates?

The suicide rates vary greatly across countries. In some countries, suicide is a leading cause of death, while in others it is much less prevalent. The countries with the highest number of suicides include Belarus, Hungary, Kazakhstan, Lithuania, Russia, and Sri Lanka, with a higher number of male suicides compared to female. The age group with the highest number of suicides is 35 to 54 years. In Kazakhstan, the highest suicide rate is in the 5 to 14 years age group, while in Sri Lanka, females in the same age group have a higher rate of suicides than males.

10. What is the pattern of suicide rates by age and gender in the bottom 6 countries with the lowest suicide rates?

On the other hand, countries with the lowest suicide rates include Antigua and Barbuda, Cabo Verde, Dominica, Grenada, Macau, Maldives, Oman, Saint Kitts and Nevis, and San Marino. Saint Kitts and Dominica have zero reported suicides, while Oman and San Marino have no reported suicides among females. Maldives also has zero suicides reported in the 55 to 74 years age group. The population size and demographic factors play a significant role in determining the suicide rate of a country.

Section 2: Economic Analysis based on Suicide Cases

In this section, we will examine the relationship between the suicide rate and economic factors, such as GDP per capita and HDI. We can create scatter plots to show these relationships, allowing us to visualize any correlations between these variables and the suicide rate. This will help us better understand the impact that a country’s standard of living and human development have on its suicide rate and provide valuable information for developing interventions to prevent suicide.

1. What is the distribution of GDP per capita for the top 10 countries?

The analysis shows a significant difference in the distribution of GDP per capita among the top 10 countries. Luxembourg stands out as the country with the highest GDP per capita, while the United States has the lowest GDP per capita among the top 10 countries. This indicates that there is a wide range of economic development among these countries, and that the level of economic prosperity can greatly impact the standard of living and overall well-being of the population.

2. What is the correlation between GDP per capita and suicide rates in different countries?

Considering top 10 countries with highest Suicide Rate

There appears to be a correlation between the GDP per capita of a country and its suicide rate. Countries with higher GDP per capita tend to have lower suicide rates, such as Slovenia. On the other hand, countries with lower GDP per capita, such as Sri Lanka, tend to have moderate to high suicide rates. An interesting exception is seen in Ukraine, which has the lowest suicide rate but also the lowest GDP per capita.

Considering Bottom 10 countries with least Suicide Rate

The correlation between a country’s GDP per capita and its suicide rate is apparent. Dominica has both low GDP per capita and low suicide rate, while United Arab Emirates has high GDP per capita but also high suicide rate. Oman presents a moderate balance between the two variables.

Based on the above insights, it can be concluded that there is not a clear relationship between the GDP per capita and the suicide rates of countries. Some countries have high GDP per capita and low suicide rates, such as Slovenia and the United Kingdom. Other countries have low GDP per capita and high suicide rates. There are also countries that fall in between with moderate rates of both variables, such as Oman. This indicates that there may be other factors that play a role in determining suicide rates, and that a high GDP per capita does not necessarily equate to low suicide rates and vice versa.

3. What is the correlation between GDP per capita and Total Number of Suicides in different countries?

Considering Top 10 countries with highest Suicides

It can be concluded from the above observations that there is no clear pattern that can be drawn between the GDP per capita and the number of suicides in a country. In some countries, like Brazil, Poland, and South Korea, there is an increase in the number of suicides with an increase in GDP per capita, while in other countries, such as France, Germany, Russia, Ukraine, and the United States, the number of suicides decreases with an increase in GDP per capita.

Considering Bottom 10 countries with least Suicides

Due to the scattered data points and limited information on suicides and GDP per capita for the countries, it is difficult to draw any meaningful conclusions from this information.

The relationship between GDP per capita and the total number of suicides is complex and can vary greatly between countries. In some cases, an increase in GDP per capita can lead to a decrease in suicides, while in other countries an increase in GDP per capita can lead to an increase in suicides. The relationship between these two variables is not clear and likely depends on various social, cultural, and economic factors. Thus, it is important to consider multiple factors and not rely solely on GDP per capita when examining the issue of suicide.

4. What are the top 10 countries with the highest GDP per capita on a yearly basis?

GDP_per_Capita VS Year : Through this plot, it is apparent that the evolution of GDP per capita differs greatly among countries. In the case of Ukraine, the GDP per capita remains largely unchanged and remains at a low level. On the other hand, the United States experiences a significant increase in GDP per capita over time, with a noticeable leap at certain points. Meanwhile, Japan’s GDP per capita exhibits a fluctuating pattern, with alternating periods of increase and decrease.

Suicides VS Year : This plot provides a visual representation of the trend of suicides in different countries over time. In the case of Japan, there is a noticeable decrease in suicides from 1986 to 1997, followed by an increase from 1998 onwards. Meanwhile, in the United States, the trend seems to indicate a steady increase in suicides starting from 2004. Overall, the plot highlights the fluctuations in the number of suicides in different countries and allows for a better understanding of the trends and patterns in each country.

In conclusion, the evolution of GDP per capita and suicide rate differs greatly among countries, as shown in the plots. While the GDP per capita in the United States has increased significantly over time, Japan’s GDP per capita has exhibited a fluctuating pattern with periods of increase and decrease. Similarly, the suicide rate in Japan has fluctuated over time, with a noticeable drop in suicides from 1986 to 1997 and an increase again from 1998 onwards. In contrast, the suicide rate in the United States has had a noticeable incline from 2004 onwards. These insights highlight the complexity of economic and social factors that contribute to the well-being of individuals and societies.

Correlation Analysis (Integrating the above Sections)

Correlation Matrix

The correlation heatmap provides an insightful representation of the relationship between different numerical variables in the dataset such as year, number of suicides, population, suicide rate, GDP per Year, and GDP per Capita. Through this visualization, it became evident that there is a strong positive correlation between population and number of suicides, which is reflected in the high correlation with GDP per Year. Additionally, the heatmap highlights a clear negative correlation between the number of suicides and the year, indicating a decline in suicide cases over time.

CONCLUSION

In our analysis, we have primarily focused on exploring the impact of demographic factors on the trend of suicides across various countries (more focus on Section 1). Through the creation of visualizations using demographic information from the dataset, we aimed to shed light on how factors such as age, gender, etc. significantly influence the rate of suicides.

This project aimed at exploring the various factors that contribute to suicide cases and the suicide rate. The first significant insight that the project has discovered is that age and generation play a crucial role in determining the suicide cases. The number of suicides increases with age, and the trend of suicides has been observed to increase over time. Additionally, the majority of suicides are committed by males, who account for nearly 70% of all suicides across different age groups.

The second insight that the project has discovered is that the suicide rate is closely tied to the total population of a country. A country with a low population and a relatively high number of suicides will have a higher suicide rate compared to a country with a larger population but fewer suicides. For example, even though both the USA and Maldives have the same number of suicides, the suicide rate in Maldives would be higher since it has a smaller population.

The third key learning from this project is the relationship between a country’s GDP and the suicide rate. The top 10 countries in terms of suicide rate and GDP per capita were analyzed in this project, and it was observed that the countries with higher GDP per capita had a relatively lower suicide rate.

In conclusion, the project has explored several critical factors that contribute to suicide cases and the suicide rate. By utilizing various visualizations, the project has provided insights into the trend of suicides over the years, the contribution of different age groups, the relationship between population and suicide rate, and the connection between GDP per capita and suicide rate. These insights can help policymakers and healthcare professionals to better understand the factors that contribute to suicides and to design effective intervention strategies to reduce suicide rates in the future.

REFERENCES

Dataset - Suicide Rates Overview 1985 to 2016 - “https://www.kaggle.com/datasets/russellyates88/suicide-rates-overview-1985-to-2016” Visualization - http://r-statistics.co/Top50-Ggplot2-Visualizations-MasterList-R-Code.html Colour Pallette - https://earlglynn.github.io/RNotes/package/RColorBrewer/index.html